The impact of single substitutions on multiple sequence alignments.
نویسندگان
چکیده
We introduce another view of sequence evolution. Contrary to other approaches, we model the substitution process in two steps. First we assume (arbitrary) scaled branch lengths on a given phylogenetic tree. Second we allocate a Poisson distributed number of substitutions on the branches. The probability to place a mutation on a branch is proportional to its relative branch length. More importantly, the action of a single mutation on an alignment column is described by a doubly stochastic matrix, the so-called one-step mutation matrix. This matrix leads to analytical formulae for the posterior probability distribution of the number of substitutions for an alignment column.
منابع مشابه
CRASP: a program for analysis of coordinated substitutions in multiple alignments of protein sequences
Recent results suggest that during evolution certain substitutions at protein sites may occur in a coordinated manner due to interactions between amino acid residues. Information on these coordinated substitutions may be useful for analysis of protein structure and function. CRASP is an Internet-available software tool for the detection and analysis of coordinated substitutions in multiple alig...
متن کاملGap costs for multiple sequence alignment.
Standard methods for aligning pairs of biological sequences charge for the most common mutations, which are substitutions, deletions and insertions. Because a single mutation may insert or delete several nucleotides, gap costs that are not directly proportional to gap length are usually the most effective. How to extend such gap costs to alignments of three or more sequences is not immediately ...
متن کاملETools: Tools to Handle Biological Sequences and Alignments for Evolutionary Studies
Sequences and alignments are the fundamental elements for Bioinformatics and thus a number of tools are provided for retrieval, handle, and analyses. However, for the molecular evolutionary studies, most of them assume human editing of data in the middle of analytical process without providing effective means. In fact, machine-produced multiple alignments are rarely good enough for later analyt...
متن کاملMaximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
We prove that maximum likelihood phylogenetic inference is consistent on gapped multiple sequence alignments (MSAs) as long as substitution rates across each edge are greater than zero, under mild assumptions on the structure of the alignment. Under these assumptions, maximum likelihood will asymptotically recover the tree with edge lengths corresponding to the mean number of substitutions per ...
متن کاملPredicting functional effect of human missense mutations using PolyPhen-2.
PolyPhen-2 (Polymorphism Phenotyping v2), available as software and via a Web server, predicts the possible impact of amino acid substitutions on the stability and function of human proteins using structural and comparative evolutionary considerations. It performs functional annotation of single-nucleotide polymorphisms (SNPs), maps coding SNPs to gene transcripts, extracts protein sequence ann...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Philosophical transactions of the Royal Society of London. Series B, Biological sciences
دوره 363 1512 شماره
صفحات -
تاریخ انتشار 2008